NSF - CAREER : The Listening Machine Annual Report 2004

نویسنده

  • Daniel P. W. Ellis
چکیده

This year, we expanded our investigation of sound analysis to look at a range of different sound 'scenes', including overlapping conversations, ambient everyday sound, and music. In each case, the goal is to abstract useful information similar to that which a human listener would perceive, and in particular to deal successfully with the issues raised by multiple, overlapping sound sources. Our most focused effort was the continued development of the novel model for sound sources we proposed last year, based on treating each spectral instant as a simple deformation of its immediate predecessor (or, in general, its neighbors). This model decomposes smoothly-varying segments of sound into a single spectral profile, and a set of locally-smooth transformation functions, describing how the spectral detail is derived from its predecessors. This year, we extended this model to a two-layer version with separate transformations applied to fine spectral structure (e.g. harmonics, to account for changes in pitch) and broader spectral structure (e.g. the formants of voice, which in general will move independently of the harmonics). The key to this model is the way that the parame

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

NSF - CAREER : The Listening Machine IIS - 0238301 Annual Report 2007 Daniel

We have continued our research into associating words with the soundtracks of recordings of natural environments. We have been working with a database of 1400 “consumer videos” (collected by our collaborators at Kodak) as well as with similar amateur videos downloaded from YouTube. Based on a provisional lexicon of 25 terms that consumers might use as search terms (“music”, “birthday”, “beach”)...

متن کامل

NSF - CAREER : The Listening Machine Annual Report 2005

Continuing our broadened theme of machine listening in many contexts, in 2005 we conducted research into automatic extraction of information in complex sound mixtures, in 'personal audio' environmental recordings, from music audio, and for the sounds of marine mammals recorded underwater. 2005 saw the graduation of Manuel Reyes, the Ph.D. student supported by this project from the start. Manuel...

متن کامل

NSF-CAREER: The Listening Machine IIS-0238301 2003–2008 Final Report

This six-year project started with the idea of applying sound recognition and separation techniques that had originated in speech recognition to a broader domain of environmental sound mixtures. As it proceeded, the work diversified into several distinct areas, reflecting the different directions of the graduate students primarily supported by the project: Manuel Reyes and Keansub Lee worked on...

متن کامل

Speeding up sum-of-squares for tensor decomposition and planted sparse vectors

We consider two problems that arise in machine learning applications: the problem of recovering aplanted sparse vector in a random linear subspace and theproblemofdecomposing a random low-rank overcomplete 3-tensor. For both problems, the best known guarantees are based on the sum-of-squares method. We develop new algorithms inspired by analyses of the sum-of-squares method. Our algorithms achi...

متن کامل

Software in Science: a Report of Outcomes of the 2014 National Science Foundation Software Infrastructure for Sustained Innovation (si 2 ) Meeting

The second annual NSF Software Infrastructure for Sustained Innovation (SI) PI meeting took place in Arlington, VA February 24-25, 2014. It was hosted by Beth Plale, Indiana University; Douglas Thain, University of Notre Dame; and Matt Jones, National Center for Ecological Analysis and Synthesis. This report captures the challenges and outcomes emerging from the meeting over the four topic area...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005